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Title: An in vitro ret hod for construction of a TO library 



FIELD OF THE INVENTION 

The present invention relates to optimizing DMA sequences 
in order to (a) improve the properties of a protein of interest 
bv artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DN'A 
shuffling technique to create a large library of "genes", ex- 
pressing said library of genes in a suitable expression system 
and screening the expressed proteins in respect of specific char- 
acteristics to determine such croteins exhibiting desired proper- 
ties or (b) irr.orove the properties of regulatory elements such as 

aeration of a library of these ele- 
cts therewith in operable conjunc- 
: c r e s s i n g said structural gene and 



>rs rr, 



promoters or term. ma: 
rents, transforming : 
tier, with a structural gene, 
screenmc for desirable rrcre: 



i h e r e c 



:latorv element . 
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In today's society the conditions of life are vastly re- 
moved from the natural environment and it has been found that the 
naturally occurring bioactive molecules are not optimized for the 
various uses to which they are put by mankind, especially when 
they are used for industrial purposes. 

It has therefore been of interest to industry to identify 
such bioactive proteins that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This has for many years been done by screening of natural 
sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzym.es for use in e.g. detergents, the washing 
ar.d/or dishwashing performance . of e.g. naturally occurring pro- 
teases, lipases, amylases and cellulases have been improved sig- 
nificantly, by 'in vitro modifications of the enzymes. 

In most cases these improvements have been obtained by 
site-directed mutagenesis resulting in substitution, deletion cr 
insertion of specific ammo acid residues which have been chosen 
either on the basis of their type or on the basis of their loca- 
tion m the secondarv or tertiary structure of the mature enzyme 



or instance US oaten t no. 4 , 5 1 & , 5 S 4 ) . 

r. •:• p r e p a : 

modified er.zym.es with altered 



this manner the oreoaration of novel polypeptide vari- 



ant; and mutants, sue:, as : 
characteristics, e.c. specific activity, substrate specificity 
thermal, pH ana salt stability, pH-optimum, pi, K^, etc. 
has successfully been performed to obtain polypeptides with Lt 

c-rcved properties . 

For instance, within the technical field of enzym.es t: 

washing and/cr dishwashing performance of e.g. proteases, i: 

rases, amvlasee and ceilulaser nave been, improved significantly. 
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An alternative general approach for modifying proteins and 
enzymes has been based on random mutagenesis, for instance, as 
disclosed in US 4,894,331 and WO 93/01285 

As it is a cumbersome and time consuming process to obtain 
5 polypeptide variants or mutants with improved functional prop- 
erties a few alternative methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., ( 1933), Nucleic Acids Research, vol. 11, 
5651, describes a method for modifying genes by in vivo reccm- 
10 bination between two homologous genes. A linear DNA sequence com- 
prising a plasmid vector flanked by a DNA sequence encoding al- 
pha-l human interferon in the 5 ' -end and a DNA sequence encoding 
alpha-2 human interferon in the 3 ' -end is constructed and trans - 
fee ted into a rec A positive strain of E. coli. Recombinants were 
15 identified and isolated using a resistance marker. 

Pompon et al., (1959,, Gene 83, p. 15-24, describes a 
method for shuffling gene c:~ams of mammalian cytochrome P-4 50 
by in vivo recombination c: partially homologous sequences m 
5a zzna rczyces cerev: suae by transforming Saccha ror.yces cerevisiae 
21 with a linearized plasma with fiiied-m ends, and a DNA fragment 
being partially homoiaccar to the encs of said plasmid. 

In y.Z) 97/C72CE a rethed is described whereby polypeptide 
variants are preparea by shuffling different nucleotide sequences 
■_ : homologous DNA sequences by ir. vivo recombination using plas- 
2 5 midic DNA as template. 

U5 patent no. 5,093,25" (Assignee: Genencor Int. Inc.) dis- 
closes a method for producing hybrid polypeptides by in vivo re- 
ccmbination. Hybrid DNA sequences are produced by forming a cir- 
cular vector comprising a re: cation, sequence, a first DNA se- 
al 1 auer.ee encoding the amino- te: a . nal portion of the hybrid poly- 
peptide, a second DNA sequence ehacdmg the carboxy-termmal por- 
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tion of said hybrid polypeptide. The circular vector is trans- 
formed into a rec positive microorganism in which the circular 
vector is amplified. This results in recombination of said circu- 
lar vector mediated by the naturally occurring reccnbinaticn 
5 mechanism of the rec positive microorganism, which include pre- 
karyotes such as Bacillus and E. coli, and eukaryotes such as 
Saccharomyces cerevislae . 

One method for the shuffling of homologous DNA sequences 
has been described by Stemmer (Stemmer, (1994), Proc. Natl. Acas. 
10 Sci. USA, Vol. 91, 10747-10751 ; Stermer, (1994), Nature, vol. 
370, 389- 391). The method concerns shuffling homologous DNA se- 
quences by using in vitro ?CR techniques. Positive recombinant 
genes containing shuffled DNA sequences are selected from a D!CA 
library based on the improved function of the expressed proteins. 
15 The above method is als: described in WO 95/22 625. KO 

95/22625 relates to a method ::: snuffling of homologous DNA se- 
quences. An important step in tr.e method described in v;o 95/22625 
is to cleave the homologous template double-stranded polynucleo- 
tide into random fragments of a desired size followed by homoic- 
:C c:u5lv reassembling of the fragments into full-length genes. 

;^r.^ r .i to the method of WO 95/22 62 5 is, 



d i s a c v a n t a g e _ . ; . . ^ : - . . - - - - 
tnat the diver sit'.' cene rated throu* 
to tne use of homologous gene sequences (as defined m 



:t method is 1 



25 Another disadvantage m tne method of WO 95/22625 lies m 

tne oroduction of the random fragments by the cleavage of the 
temolate double-stranded polynu o leotice . 

A further reference of interest is WO 95/17413 describing a 
method of cene or DNA shuffling cy r eocmomat ic 
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PGR generated sequences to produce so-called functional elements 
(FE) comprising at least two of the design elements. According 
to the method described in WO 95/17413 the recombination has to 
be performed among design elements that have DNA sequences with 
5 sufficient sequence homology to enable hybridization of the dif- 
ferent sequences to be recombined. 

WO 95/17413 therefore also entails the disadvantage that 
the diversity generated is relatively limited. Furthermore the 
method described is time consuming, expensive, and not suited for 

10 automat isation . 

Despite the existence of the above methods there is still a 
need for better iterative in vitro recombination methods for pre- 
paring novel polypeptide variants. Such methods should also be 
capable of being performed in small volumes, and amenable to 

15 automatisation. 

summary or th£ invention 

T^e present invention concerns briefly the utilization of 
te-olate shift of a newly synthesized ON.-, strand during in vitro 
2: synthesis in order to achieve DNA shuffling. 3y using this 

technique it is possible :: cotam such results in a more expedi- 
ent manner, anc to seme extent even a greater variation than ir. 
the above mentioned methods. 

Tne method of the invention is also very well suited for 

7.5 adaption to automatisation. 

Ir. a preferred embodiment the technique is used in combma- 
ticr. with an error-prone polymerase thereby introducing an even 

greater variation in the library created. 

— vent lor. relates to a method 
More soec 1 1 1 ca i _ > ^h- _...c.n--~-. 



-eccmbinec homologous 

;ru^ -*u.. ^ - - — ~ - - - j- — 

polynucleotides fro::, a r.urrer 



:he construction of i ^icrary 

- of different starting sincl 
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double stranded parental DNA templates and primers by induce; 
template shifts during an xr. vitro polynucleotide synthesis usir.r 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

5 a) denaturing parental double stranded DNA templates 

produce single stranded templates, 

b) annealing said primers to the single stranded DNA ten- 
plates , 

c) extending said primers by initiating synthesis by use zz 
10 said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double, strand to separate the extended 
orimers from the templates, 

B. a template' shift is induced by 

15 a) isolating the newly synthesized single strandeo extended 

o^T^rs from the templates and repeating steps A.b) to 
A.e) using said extended primers produced in (A) as both 
primers and templates, or 

b) reseating steps A.b; to A.e), 

„^ ~ c ; ^ = - o-; p-'te" an aooroonate number 
2Q c. the aoove process _s ^e o ... 

A. and B.b! , or ctr- 



a: 



.a 



c: cycles of process steps 

citations thereof, ar.c 

^■^ r- ^ ^ ~ . , ,~ o - n ^M-^-iP 0 ::ce5 are amoiified in a 
optionally tne pr^ — ^- p^**..-- ^ — 

-eaction with specific primers to selectively 



s tar.darc 



am.o 



lifv homologous polynucleotides o; 



interest 



In soecific embodimen 
.r. the process of the ir.ver 
:o aoply a defective polyme 
— ^--odu~e mutations m ; 



various modifications can oe mace 
r 0 - exam.oie it is advantage: us 
e either an error-prone polymerase 
- r i s o n to tne templates, or a t z - 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below. 
In a further aspect the invention relates to a method of 
5 identifying polypeptides exhibiting improved properties in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
10 to be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and positive results selected. . 

In a still further aspect the invention relates to a method 
for producing a' polypeptide of interest as identified in the pre- 
15 ceding process, whereby a vector comprising a polynucleotide en- 
coding said polypeptide is transformed into a suitable host, said 
host is grown to express saic polypeptide, and the polypeptide 
recovered and purified. 

Finally, in further final aspects the invention 
20 relates to a recorrJ3ir.ee/shuffied protein, which is obtainable by 
anv of the methods according to the invention, and which is a 
recomJoined/shuffled protein comprising the sequences disclosed 

herein {vice infra). 

In those final aspects of the invention, the term 
25 ^obtainable'' denotes that said protein is preferable obtained by 
a method according to the invention. However a prior art known 
recombination/shuffling technique such as those described in V.'O 
95/22 625 or WO 95/17413 may be used too, either alone or m 
combination with a method according to the invention, in order to 
30 obtain saic recombined protein. 

Accordingly, further final aspect of the invention are; 
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a r e c o mb i n e d / s h u f f 1 e d protease obtainable by any of the 
methods according tc the invention, and comprising a recombined 
sequence, which at least contain two different partial sequences 
from at least two different wild-type proteases; 
5 a recombined/shuffied lipase obtainable by any of the 

methods according to the invention, and comprising a recombinec 
sequence, which at least contain two different partial wild-type 
sequence s 

a recombined/shuffied Pseudomonas lipases obtainable by any 
10 of the methods according to the invention, and comprising a 
recombinec sequence, which at least contain two different partial 
sequences from at least two of the different wild-type 
Fseudomonas lipases; 

a recombined/shuf f led xylanase obtainable by any of the 
15 methods according to the invention, and comprising a recombinec 
sequence, which at least contain two different partial sequences 
- rc - az least two different wild-type xylanases; 

a recombined/shuf f led collaiase obtainable by any of tr.e 

.-ention, and comprising a :e:c"JDir.e: 
tain two different partial sequences 
■ilc-tvpe ceiiulases; 

amvlase obtainable by any of the 
/ention, and comprising a reccmbined 
50 - u . 2r . CC/ which at least contain two different partial sequences 
from, at least two different wild-type amylases 

reccmbined/shuf f led laccase obtainable by any of the 
mothohn according tc the invent ion, and comprising a recombinea 
:;ecucnae, which at least ::rn..;:. two different partial sequences 
f*o- a." least two different ■.. :.a- 1 ype la ceases; 

n rocomcinea/shu: flea : n/tase obtainable by any o: the 
- - — - - o ~'r^ - n a an and corner i sing a recombinec 



:o 



.east two c i : : e r ■_■ n 



Cl w O J - ^ - 
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sequence, which at lease contain two different partial sequences 
from at least two different wild-type phytases. 



5 DEFINITIONS 

Prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination of 
nucleotide sequence fragment (s) between two or more homologous 
10 polynucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having a 
number of nucleotide fragments exchanged, in comparison to the 
input polynucleotides (i.e. starting point homologous polynucleo- 
tides) . 

15 "Homology of DMA sequences or polynucleotides" In the pres- 

ent context the degree of DNA sequence homology is determined as 
the cegree of identity between two sequences indicating a deriva- 
tion cf the first sequence from the second. The homology may 
suitabiv be determined bv means of computer programs known in the 
20 art, such as GA? provided m the GCG program package (Program 
■-'a-ual for the Wisconsin Packace, Version 6, August 1994, Genet- 
ics Computer Grouo, 575 Science Drive, Madison, Wisconsin, USA 
52711) (Needleman, S.E. and Wunsch, CD., (1970), Journal of Mo- 
lecular 3iclogy, 45, 443-452). 
25 "Homologous": The term "homologous" means that one single- 

stranded nucleic acid sequence may hybridize to a complementary 
sincie-stranded nucleic acid sequence. The degree of hybridiza- 
tion may depend on a number c: factors including the amount of 
ic-ntitv between the sequences and the hybridization conditions 
30 such as temoerature and sell concentration as ciscussed later 



Using the computer program GA? {vide supra) with the fol- 
lowing settings for DNA sequence comparison: GAP creation penalty 
of 5.0 and GAP extension penalty of 0.3, it is in the press- 
context believed that two DNA sequences will be able to hybridize 
5 (using low stringency hybridization conditions as defined below) 
if they mutually exhibit a degree of identity preferably of at 
least 70%, more preferably at least 80%, and even more preferably 
at least 85%. 

"heterologous": If two or more DNA sequences mutually e:*:- 
10 hibit a degree of identity which is less than above specified, 
they are in the present context said to be "heterologous". 

"Hybridization:" Suitable experimental conditions r = r 
determining if two or more DNA sequences of interest do hybridize 
or not is herein defined as hybridization at low stringency as 
15 described in detail below. 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions are detected using a x-ray film or a phos- 
phoimager . 

"primer": The term "primer" used herein especially in con- 
2Z nectien with a PGR readier: is an oligonucleotide (especially a 
" "--orim.er" ) define: azz zz r.s t rue ted according to general star.- 
dare soecif ications -:ncwr. m the art ("PGR A practical approach" 
IR1 Press, (1591) ) . 

"A primer cirectec ;c a sequence:" The term "a primer ci- 
25 reeled to a sequence" means that the primer (preferably to he 
usee in a PGR reaction) is constructed to exhibit at least 2 r j* 
decree of sequence identity to the sequence fragment of interest, 
more preferably at least rZl degree cf sequence identity to the 
secuer.ee fragment cf interest, wnich said primer consequently is 
30 "ejected to". The crimer i zesigned to specifically anneal at 
tne seouenre fragment rec::n it is cirected towards at a given 
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temperature. Especially identity at the 3' end or the printer _ is 
essential . 

"Flanking" The terr. "flanking" used herein in connection 
with DNA sequences comprised in a PCR-f ragment means the outer- 
most partial sequences of the PCR-f ragment, both in the 5" and 3" 
ends of the PCR fragment. 

"Polypeptide" Polymers of amino acids sometimes referred 
to as proteins. The sequence of amino acids determines the 
folded conformation that the polypeptide assumes, and this in 
turn determines biological properties and activity. Some 
polypeptides consist of a single polypeptide chain (monomer ic ) , 
whereas other comprise several associated polypeptides 
(multimenc) . All enzym.es and antibodies are polypeptides. 

"Enzvme" A protein capaole of catalysing chemical reac- 
tions. Specific types of er.zyr.es to be mentioned are such as 
amylases, proteases, carbor.ydrases, lipases, cellulases, oxi- 
doreouctases, esterases, etc. 0: specific interest in relation 
to the present invention are enzymes used in detergents, such as 
proteases, lipases, cellulases, amylases, etc. 



DETAILED DESCRIPTION 0: TK:, INVENTION 

The oreser.t invent icr. relates m its first aspect to a 
method for the construction of a library of recomimed homologous 
25 colvnucieotides from a number of different starting single or 
double stranded parental DNA templates and primers by induced 
t»moiate shifts curing an ir. vitro polynucleotide synthesis using 
a do i vrr.e rase , wnereDy 

A. extended primers or polynucleotides are synthesized by 
30 . a) denaturing a parent:.:, double stranded DNA template to 

croduce single s;:ar.:t: ~ expiates, 
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b) annealing said primers to the single stranded DNA te: 
plates, 

c) extending said printers by initiating synthesis by use 
said polymerase, 

5 d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extenc 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded extenc 
10 primers from the templates and repeating steps A.b) 

A.e) using said extended primers produced in (A) as be 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is terminated after an appropriate numx 
15 0 f cycles of process steps A. and 3. a), A. and 3.b), or cc 

binations thereof, and 

D. cotionally the produced polynucleotides are amplified in 
standard PCR reaction with specific primers to selective 
amplify homologous polynucleotides of interest. 

2 0 

i"o^ci" to the invention the polymerase m.ay be a DNA or 
F.NA relvmerase, s:e::::c polymerases to be mentioned are such 
Ci;A polymerases like 74 polymerase, 77 polymerase , Z. coil Z 
polymerase I or tne Kiencw fragment of DMA polymerase I, or the 
25 mostabie polymerases such as 7ac, Amplitaq, Vent, Pwo . 

One of the advantages of the invention is that it makes 
possible to control the length of the extension of the primers 
the reaction in a convenient manner. 

This can be acccrplis-: by various means such as choice 
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tion of the polymerase , e.g. pH, temperature, buffer, salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the DNA syn- 
thesis at different rates (nucleotides incorporated pr. second) . 
5 For example has the Kienow fragment of DNA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (Sambrook et 
al. 19S9) . 

Polymerases also display differences in processivi ty, which 
is the average number of nucleotides incorporated before the po- 
10 lymerase dissociates from the template/extended primer; again the 
Kienow polymerase is an example of a polymerase with limited 
processivi ty . 

Tne choice of polymerase is therefore an important means in 
controlling the average extension of the primers. 
15 These conditions may also exert an influence on the fidel- 

ity of the polymerase (the rate by which point mutations are in- 
troduced; HIV reverse transcriptase is an example of a polymerase 
c: low fidelity) , a parameter useful in combining shuffling and 
mutagenesis . 

— 1 7. specific embodiment s various modifications can be made 

~r.e orocess of tne invention. Tor example it is advantageous 
— acolv a defective polymerase either an error-prone polymerase 
i- ' nt reduce mutations in comparison to the templates, or a po- 
*'~~^--s o that will discontinue tne polynucleotide synthesis pre- 

25 maturely to effect tne arrest c: the reaction. Such a defective 
polymerases that could be mentioned is a Kienow polymerase having 
low process ivity . 

In another embodiment the invention polymerase will be 

added after each cvcie, 1: ■ polymerase used is not thermo- 



p 
en: 
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According to the invention the starting single or double 
stranded parental templates nay be different in that they contain 
different point nutations in the sane native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) iso- 
5 lated fron nature, which nay be anplified by PCR, or they can be 
combinations thereof. The templates used in the process of the 
invention are hereby homologous showing an identity at the DN.-. 
level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 6Ch, 
55%, or even more than 50% identity. 
10 it may be advantageous to use pre-selected templates ccz- 

ising mutations with improved properties of interest. The pres- 
recombination method of the invention will then recombir.e 
said improved mutations for subsequent screening for even further 
improvements in the properties of interest. 

Said pre-selected templates with improved properties of in- 
terest may have been identified by standard procedures in the art 
comprising e.g. i) error-pror.e ?CR of templates of interest fol- 
lowed by ii) screening/selection for templates with improved 
characteristic of interest. The mutagenesis frequency (low or 
high mutagenesis frequency: cf tr.e error-prone PCR step is pref- 
erably adjusted in relation tc the subsequent screening capacity, 
he screening capacity is limited the error-prone ?CR 
is oreferabiy lew [i.e. cne to two mutations in each 
temoiate) (see WO 92/1S5-; 5 for further details). 

The arrest of the polymerase reaction in step A.d) may as 
indicated above be obtained in different ways, such as by raising 
the temperature, or adding specific reagents as described ir. Y.0 
9 5/17 413. 

Wnen raising the temp-:. r..t ure for this purpose, it is pre- 

ferrec to use temperatures :. • * • •• • • ar - z 



- - ] 
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When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
primer . 

Also, the primers to be used may be completely random prim- 
ers ( NNNNNNNNNNN ) (N meaning a mixture of the four bases (A, 7, 
G , C) is used at a particular position in the primer during syn- 
thesis), semi-random primers, or specific primers. 

If the extended primers produced are to be separated from 
the primers during the process it is convenient to use labeled 
templates in order to provide a simple means for separation, a 
preferred marker is biotin or cigoxigenin. 

According to the invention the number of cycles necessary 
will be less than 500, in most cases less than 200, and normally 
less than 100 cycles. 

In an embodiment cf the invention the above in vitro shu:- 
flmc is combined with a subsequent in vivc shuffling by methods 



such as those described i h V.' 3 9 7 / 0 "7 2 



n : 



In its second asoect the invention relates to a method o: 
i c-r ~ ; f vi :.: oolvoeotides exhibiting imcrcvec properties m com- 
parison to naturally occurring polypeptides cf the same noactiv- 
itv, whereby a library of recomoined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
to oe exoressec into the corresponding polypeptides, said 
Doivoeotices are then screened ;r. a suitable assay, and positive 
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In an embodiment it is contemplated that the polypeptides 
of interest encoded by the shuffled library are expressed as a 
suitable fusion protein (e.g. as a hybrid with gill of bac- 
teriphage M13/fd) in order to display said recombined polypeptide 
5 on the surface of phage or bacteria. 

In a third aspect the invention relates to a rr.ethod fcr 
producing a polypeptide of interest as identified in the preced- 
ing process, whereby a vector comprising a polynucleotide encod- 
ing said polypeptide is transformed into a suitable host, saif 
10 host is grown to express said polypeptide, and the polypeptice 
recovered and purified. 

The use of partial random (semi-random) or completely ran- 
dom primers (mixtures of bases in a selected number or all posi- 
tions in the primer) as initiation point for the DNA synthesis 
15 provide certain novel possibilities for the combined use of shuf- 
fling and random mutagenesis. 

I t is often associated with difficulties to obtain an ir. 
vitro recombination of polynucleotides that display relatively 
limited homology. 3y the use cf an embodiment of the invention 
2C ever, very diverse polynucleotides can be forced to recombme . 

According to that emocdiment at least two templates (or tw: 
cccl5 o: diverse templates ; are applied. The novel synthesis c: 
the one polynucleotide can tnen ice based on only one strand (i.e. 
either the sense or the anti-sense strand) , and the synthesis c: 
25 tne other polynucleotide is oased the opposite strand. 

This can be accomplished by isolating the complementary 
strands from tne two templates, e.g. by having these strands la- 
beled by biotm. Synthesis c: is initiated by annealing ei- 
ther specific, partly rar.cc:: :r completely random primers :: 
30 these templates and addinc ruitable polymerase. This can re 
performed as either separate reactions for the different tern- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that favor production of relatively 
short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PCR reaction is 
5 carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recombine in order to produce full length PCR 
products - a kind of forced recombination. 

Also for this embodiment rapid PCR with short or no exten- 
10 sion time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for the 
two strands . 

The length 'of the primer and the annealing temperature 
utilized in the' process determines if random primers will anneal 

15 and the number of mismatches between the template and the primer 
that can be accormoda ted . By varying the primer length and the 
annealinq temperature the method of the invention provides a 
means for achieving rancor, mutagenesis within a certain nucleo- 
tide wmdov; representing the length of the primer. The method of 

20 the invention thereby provides substantial benefits compared tc 
other random mutacenesis approaches, especially the high prob- 
*;- v *r or several base substitutions close to each other in the 
srimarv seauence, e.c. the use of a completely random 20'mer 
(mixture of all four nucleotides in ail 20 positions) will ac- 

25 ccrdina to theory under given experimental conditions give a cer- 
tain reasonably high probability for having several base substi- 
tutions close to each other. 

Error prone PCR (=high mutagenesis frequency PCR) does not 
provide this possibility. Error prone PCR provides a very low 

2Q o^c'^b" lit v for havinc mere than one base substitution within one 
couon (ccdino for one ammo acid m a translated polypeptide) . 
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Obviously the substitution of only one base within a code: 
doesn't provide total random mutagenesis (at protein level) as 
only a limited set of amino acid substitutions can be obtained b\ 
one base substitution at DNA level (e.g. Methionine encoded by 
ATG-codon requires three base substitution to become the TGT or 
TGG-codon encoding Cysteine) . 

In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp, preferably from 10 to 70 bp, and better fron 
15 to 40 bp. 

One of the advantages in the method of the invention is the 
robustness. In some embodiments the constant presence of full 
length template provides a further advantage avoiding PGR con- 
tamination problems. Furthermore it is much less laborius, with 
less hands on, than other described methods, thereby providing 
excellent possibilities for au t : ~.a t ion . 



?CR-pr imers : 

The PGR primers are :::,::;:uc:ed according to the standarc 
descriptions m the art. Normally they are 10-75 base-pairs (bp] 
long. However, for the specific embodiment using random cr semi- 
rancor, primers the ier.rt:". r .: v be substantially longer as indi- 
cate: above. 



?GR-react ions : 

If not otherwise mentioned the ?CR-reaction performed ac- 
cording to the invention are performed according to standard pro- 
tocols known in tne art. 

The term. "'Isolation : ; fragment" is intended tc cover 

as broad as simply an alloc:- . r.taming tne PGR fragment. How- 
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ever preferably the PCR fragment is isolated to an extend which 
remove surplus of primers, nucleotides templates etc.. 

In an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
high random mutagenesis frequency. 

To obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment (s)) may be prepared by a standard 
PCR amplification method (US 4,683,202 or Saiki et al., (1988), 
Science 239, 487 - 491) . 

A medium or high mutagenesis frequency may be obtained by 
performing the PCR amplification under conditions which increase 
the mis incorporation of nucleotides, for instance as described by 
Deshler, (1992), GATA 9(4), 103-105; Leung et al., (1989), Tech- 
nique, Vol. 1, No. 1, 11-15. 

It is also contemplated according to the invention to com- 
bine the PCR amplification (i.e. according to this embodiment 
also DNA fracment mutation) with a mutagenesis step using a suit- 
able physical cr chemical mutagen! c inc agent, e.g., one which in- 
duces transitions, t rans ve rs ions , inversions, scrambling, dele- 
tions, and/or insertions. 



:■: o r e s s i n c the r e c cmb i n a n t c rote in, from the recombinant shuffl 



sequences 

Expression of the recombinant protein encoded by the shuf- 
fled seauence in accordance with the second and third aspect of 
the oresent invention ma v be performed by use of standard expres- 
sion vectors and corresponding expression systems known in the 
art . 
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In the context of the present invention the tern "positi 
polypeptide variants" means resulting polypeptide variants po 
sessing functional properties which has been improved in compar 
son to the polypeptides producible from the corresponding inp- 
DMA sequences. Examples, of such improved properties can be 
different as e.g. enhance or lowered biological activity, i: 
creased wash performance, thermostability, oxidation stabilit; 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for icentif; 
ing positive variants depend on which property of the polypepti 
in question it is desired to change, and in what direction t: 
change is desired. 

A number of suitable screening or selection systems 
screen or select for a desired oiologicai activity are cescrib 
in the art. Examples are: 

Strauberg et al. (Biotechnology 13: 669-673 (1995) c 
scribes a screening system for subtiiisin variants having Ca 
c ium- mceoendent stability; 

(Proteins 1:326-33^ (1966)) describes 

enhanced thermal stabi 



: y a n e t a i . 
\z a s s a v for 



;es navm: 



an 



c-~--;jr;9c/00322 cestrioes a screening assay ror 
improved wash oerfcrmar.ee in washing detergents. 

An embodiment of the invention comprises screening or s 
ion of recombinant protein (s), wherein the desired biologic 
.vitv is oerformance in dish-wash or laundry detergents. Ex a. 



.es cf suitable dis^ 



■sr. 



.aur.drv detergents are disclose: 
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ance, the screening nay conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
5 the medium being provided with a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability. The recombination host 
cell is located on the second filter. Subsequent to the incuba- 
tion, the first filter comprising the enzyme secreted from the 
10 recombination host cell is separated from the second filter com- 
prising said cells. The first filter is subjected to screening 
for the desired enzymatic activity and the corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may be 
15 any protein binding filter e.g. nylon or nitrocellulose. The top- 
filter carrying the colonies of the expression organism may be 
anv filter that has no or low affinity for binding proteins e.g. 
cellulose acetate or Durapore^. The filter may be pre-treated 
with anv of the conditions to be used for screening or may be 
o *- v- ^ r. *- o ~ c u r n c the detection or e n z vm. a t i c activity. 

-tic aztivitv mav be detected by a dye, fluores- 
cence, orecipitation, pH indicator, I R-absorbance or any other 
known technicue for detection of enzymatic activity. 

^■ r . 0 detect mo ccmoounu mav be immobilized by any immobiiiz- 
25 ing agent e.g. agarose, agar, gelatin, polyacryiamice , starch, 
filter pacer, cloth; or any combination of immobilizing agents. 

If the imoroved functional property of the polypeptide is 
not sufficiently good after one cycle of shuffling, the polypep- 
tide mav be subjected to another c y c i e . 
30 - in an embodiment cf the invention wherein homologous 

col ^nucleotides reoresent m:; a number cf mutations of the same 
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gene is used as templates at least one shuffling cycle is a back- 
crossing cycle with the initially used DNA fragment, which may he 
the wild-type DNA fragment. This eliminates non-essential muta- 
tions. Non-essential nutations may also be eliminated by using 
5 wild-type DNA fragments as the initially used input DNA material. 

Also contemplated to be within the invention is polypep- 
tides having biological activity such as insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, pitu- 
ary hormones, somatomedin, erythropoietin, luteinizing hormone, 

10 chorionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. 

A requirement to the starting parental DNA sequences, en- 
coding the polypeptide (s) , to be shuffled, is that they ar.e £. 

15 least 50%, 60%, 70s, 801, 90S , or 951 homologous. DNA sequences 
beir.c less homologous will have less inclination to interact and 
recombine . 

It is also contemplated according to the invention to shuf- 
fle parental polynucleotides that are homologous as indicated 

2: a::ve originating frcm v.-ild type organisms of different genera. 

further, the star-mo oarental templates to be shuffled may 
preferably have a lengtn cf from about 50 b? to 20 kb, preferably 
about ICO bp to 1C >:b, more preferred about 200 bp to " kb, espe- 
cially about 400 bp to 2 !-:c . 

2 5 The starting parental DNA sequences may be any DNA se- 

quences including wild-type DNA sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, and may also be the outcome of DNA 
sequences having been suo;ecteu to cne or more cycles of shuffi- 

50 ir.o (i.e. output DNA sequent-.-. according to the method of the 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting recor.- 
bined homologous polynucleotides (i.e. shuffled DNA sequences), 
5 have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if comparing it with the parent polypeptide. It is to be 
understood that also silent exchanges are contemplated (i.e. nu- 
cleotide exchange which does not result in changes in the amino 
10 acid sequence) . 

MATERIALS AND METHODS 

Specific Method Used in the Examples: 

A) DNA encoding different enzyme variants of the same gene or 
15 different enzymes having the same type of activity encoded by 
homologous genes are mixed. Tr.o DNA is provided as either PGR 
fragments, plasmid, phage or cer.cmic DNA. 

3) The resulting pool of EN A is mixed with DNA Polymerase, 
dN7?, a suitable buffer ar.c primers (being either random oli- 

22 earners (length of 6-3 0 nucleotides) or specific oligomers 
{length cf 6-50 nucleotide-; or a combination of both types). 
C ) The PGR mixture i c put into a PGR t he rmocycie r (either 
cole or hot) in a suitable tube. 

cl) The t he rm.oc y c 1 e r is heated to a temperature of 80- 

25 100'C for a period of time (typically 1-10 min) in order to dena- 
ture the DNA templates. 

c2) Thereafter the fcllcwing procedure (cycle) is fol- 
lowed (repeated): The template is denatured (typically 90-100°C 
for 0-5 minutes) . Then the t-:: - : nature is lowered (typically tc 

30 a value between 10°G and 90 |C C :;: 0-5 minutes) to allow annealing 
of the primer to the single stranded template. Nov; the tempera- 
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ture is raised again to denaturat ion temperature (90-100°C) al 
lowing small extension of the primer to be synthesized by th 
DNA polymerase during ramping. Alternatively a short extensic 
period (typically 0-30 seconds at 70-75°C) can be introduced z 
5 allow larger extensions of the primers to be generated. When tr. 
temperature reaches a value where denaturation takes place, tr. 
extended primers and templates are again separated. This proce 
dure can be repeated (typically between 1 to 99 cycles). 
D) Having performed the desired number of cycles the gener 

10 ated small DNA polymers can be purified from the oligomers use 
as primers. One way is to isolate and clone a specific amplifie 
band containing the gene codir. g for the polypeptide of inter ez 
into a suitable vector. This can be done either on agarose ge 
(tvpicallv isolating fragments between 50 to 1000 base pairs; 

15 by beads (using an affinity label on either templates or p Ti- 
ers ) or through columns. 

Z) Then the purified (or the not purified) DNA polymers C: 
be assertjoled in a standard FCR reaction (for instance 94°C, 
mi — tes, (94°C, 30 sec; 55 s :, 2C sec; 72°:, 2 mm)- 2 5, 72°C 



Srecific o rimers ■;: DNA cclvmers generated dv speci:; 

orimers can be adciec in crcer to generate a specific DNA polym.e 

cent a in in c the cene cf interest. This As mentioned in point D 

this DNA ooivmer can be purified ana cloned into a vector of in. 
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Method 1 

The strong advantage of the method exemplified here is the 
robustness and lack of PCR contamination problems, due to the 
constant presence of parental template. Furthermore this method 
is less labor demanding than methods described in the prior art, 
thereby providing excellent possibilities for automation. 

Nine different plasmics containing DNA sequences encoding 
9 different variants of the H . lanuginosa lipase gene, were 
mixed in equimolar amounts. The variant genes contained from two 
to seven nutations scattered throughout the gene. 

The DNA sequence of the X. lanuginosa lipase gene and the 
ammo acid sequence of the lipase are disclosed in EP 0 305 216 

The variants are indicated according to standard termmoi- 



as describee 



e.c. EP C 



603, and WO 97/07202 



Lar. 



shuffle 



1 ' . N94K-HD96L + E99K 

2 ' . S???.?.?-H-'S4r'^D9S! 

3 . . s??=f.?+.:..:9;>;:3£. : 

4 ' . ST??.?.?tN9-;?. 



2 2 1 P.- N 2 3 3 22 3 4 P-Q2 4 9 P. 



25 



SPPRP.P+D57G-N9 4 K-D9 5L + Q2 • 



The following components 
2 ui o la sr. id mixtu 



xed in a microtube: 

uc/ui;, specific primers fiar. 
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10- tac buffer (Perkm Elmer), 0.5 pi taq enzyme in a total vol- 
ume of 20 (il. 

The tube was set in a Perkin Elmer 2400 thermocycler . Ths 
following FCR-progran was run: (94°C, 5 minutes) 1 cycle: 
(94°C , 30 seconds, 1C°C, 0 seconds) 99 cycles(72°C / 2 minutes, 

4°C indefinite) 1 cycle 

The PCR-reaction was run on a 1.5 % agarose gel. A DNA- 
band of the specific expected size was cut out of the agarose 
gel and purified using JETsorb (from GENOMED Inc.). The purifie 
?GR-product was cloned into a TA-vector (from Invitrogen Un- 
original TA cloning kit). The iigated product was transform- 
into a standard Escherichia coli strain (DH5a) . 

20 transformants where fully sequenced across the gene o 
interest. 

Result : 

The following 20 variants were found: 

1 . D:3 7G + D167G + E2107rY213C 

2, s???.R?tD57G-N9 4K09cL-Q2 4 9 R 

: . :;94R-r95i 

4 . S?r?.?.?Oi3 7G-;jl£7G~E: 1CV 

: . ::94:>D9cL^E99:*^VlE7A^T257: 

£ . 1127G-D157G-E2107 

7 . :;94:;+D9cl-E99:>V137A 

E . D57G-i-N94R-rF95L^Q2 4 9R 

9 . ::94?;rD95L-E99K-E21QY 
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14 . SPPRRP+Q15P+A19C+C36A+N94K+D96L 

15. SPPRRP+N9 4K+D9 6L+T2 31R+N2 33R+D2 3 4R+Q2 4 9R 

16. D137G+D167G+E210V 

17 . SCIRR+N94K+D96L+Q2 4 9R 

18 . N94K+D96L+E99K 

19. N94R+F95L 

20 . SPPRRP+N9 4R+F95L+F113S+Q2 4 9R 

Nearly all mutations where represented (19 of 20) indicat 

ing little bias for specific templates. 



Statistics: 


Not shuffled 




10 


Shuffled between at 


least 2 templates 


8 


Shuffled between at 


least 3 templates 


2 



The shuffled sequences can then be subcloned from the E.coli T 
vector into the yeast vector pJS026 as a BamHI-Xbal fragmen 
(see WO 97/07205), and e.g. screened for new shuffled sequence 
v; ; —proved oerformar.ee in detercents (see WO 97/07205) . 

Method 2 : 

PGR products of 10 different lipase variant genes wer 
generated as above and pooled m equimolar amounts. 

The following 10 mutant genes were shuffled. 

1 ' . D137G + D167G + E210V 

2 1 . D96L+Z99K+V187A 
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4 ' 
5' 
6' 

7 ' 

8 * 
9' 
10 



SPPRRP+D57G+N94K+D3 61tQ2 4 9R 
D111N+F211A+G225P 

SPPRRP+N94K+D9 6L+T2 31R+N23 3R+D2 34R+Q24 9R 
SPPRRP+A19C+C3 6A+N9 4K+D9 6L+Q2 4 9R 
STPRRP+N94R 
N94R+F95L 

SCIRR+N94K+D9 6L+E2 3SC+Q2 4 9R 



The following mixture was generated in a suitable tube: 
10 1 |ul PCR mixture (0.1 ^g) , decamer random prir.er (3CI 

pmoi), 2 yil 10* Klenow buffer (Promega), 0.25 mM dNTP, 2.5 mM 



MgCi, in a total volume of 20 ul . 

The mixture was set ;r, a 
following program was rur.: 9C C 0, 
ml Klenow enzpe was added/ 2: c 2 

This procedure generate: a 
mers originating from ail parts : 

10 jul was taken ou 



PEI24 00 the rmocycle r where the 
5 minutes , 25°C 5 minutes, 0.5 
10 minutes, 35°C 90 minutes, 
high number of small DNA poly- 
oar t 5 of the gene 
::: test on agarose gel. 



seconds/ 45°C, 30 seconds, 
minutes, 4°C indefinite. 

T n e PCR products wet: 
unciasec smear was seen. 



mX cNTP, 1 ja 1 10* Tag buffer 
0.5 (ul Tag enzyme) was added to 
thermocycier . Titer, the following 
94 0 C, 5 minutes) 1 cycle, (94°C 2Z 
Z 30 seconds) 25 cycles, 72°C 7 

t cn a 1.5% agarose gel. A clear 
tween 400 and 800 bp was isolated 
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0.25 mM dNTP, 2 |il 10* Tac buffer, 2.5 mM MgCl 2 . Then the fol- 
lowing standard PCR-program was run: (94°C , 5 minutes) 1 cycle, 
(94°C 30 seconds, 50°C, 30 seconds, 72°C 30 seconds) 25 cycles, 
72°C 7 minutes, 4°C indefinite. 

The PCR product was run on a 1.5% agarose gel. A specific 
though weak band of the expected size was isolated. Additional 
PCR was run using specific primers (as mentioned above) in order 
to amplify the PCR-product before cloning. 



l at 



:R-product and the desired vector were cut with the 



appropriate restriction enzymes (3amJ-iI/XhoI ) . The vector and the 



agarose gel, and purine: 



PCR product were run on a 1.; 
the gel. 

The cut PCR-product and the cut vector were mixed in a 1: 



case buffer with T4 DNA ligase ;?rcmega). After overmen: 



iga- 



:c l tr.: 



transformed into 



coli strain 



15 clones were fully sequenced across the gene, 
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10. N94R+Q249R 

11. D167G 

12. D167G 

13. T32I+N94R+F95L+D167G+Q249R 
5 14. E87K+N94K+D96L 

15. N94R+F95L+Q249R 

16. N94K+D96L+D111N 

17. STPRRP+S17T 

18. N94K+D96L+V187A 

10 19 . SPPRRP+D57G+N94K+D9 6L+D111N+L151S 

All template variants were represented indicating littlt 
bias for specif ic 'templates . 

There were no apparent hot spots with regard to mutatior 
15 exchange and it seems to be evenly distributed along the gene 



Statistics : 



Not shuffled 








1 


Snuffled 


between 


a t 


least 2 


templates 


10 


Snufflec 


between 


a t 


least 3 


templates 


6 


Snuff leu 


b e t w e e n 


a t 


least 4 


tern plates 






b e twee n 


a t 


least 5 


templates 


0 


Shuffled 


b e t w e e n 


a t 


least 6 


templates 





The shuffled sequences can then be subcioned from the 
E.ccli 7 A vector into the '/east vector pJS026 as a BamKI-Xbal 
20 fragment (see WO 97 /07205;, and e.c. screened for new shuffled 
secuences with improved performance m detergents (see WO 
97/07 205) . 
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Example 3: 

Amylase variant shuffling: 

5 In Example 1, it was shown how a number of multiple vari- 

ants of H. lanuginosa lipase were shuffled. In a similar manner, 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-anylases from Bacillus species improved for particular 
10 properties, e.g. thermostability, stability under Calciun- 
depleted conditions, improved wash-performance etc. (see 
WO95/10603, W096/23874, W096/23873, and PCT/DK97 /00 1 97 ) . 

Variants of B . licheniforr.is a-amylase amyL can be shuf- 
fled as follows. The variants are all located in the S. subtilis 
15 expression vector pDN1528 described in WO95/10603. 

The experiment is carried cut under the exact same condi- 
tions as Example 1 except that the flanking 27mer primers used 
to initiate DN'A synthesis were different. 

The ?CR amplified band of approximately 1500 bp is puri- 
2 0 fied from an agarose eel and cloned as described in Examplel. 

: ^ . . tive ^ v restriction sites located within the amyL cene car. 
utilized to clone the library of shuffled genes into either 



oe 



piasmic DL'NiD^ or an £. coli vector containing th< 



wild type amyL gene, e. 



JeENl described in W096/23874 



25 



Example 4 





Wu 98/41 65s 



t> *' m f nr\ rnOfftAini 



Shuffling of two genes encoding homologous a-amylases: 
amyL and the amylase identified by SEQ.ID no2 (amino acid) and 
SEQ.ID no. 5 ( DNA) described in W096/23873. 

5 The forward strand (identical to the mRNA) of amyL can be ampli- 
fied in a PCR using standard conditions. 

The forward strand is separated from the reverse strand 
based on its affinity to streptavidin coated magnetic beads and 
denaturation of the two strands with NaOH. 
10 Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ.ID no5 (W096/23873) can be amplified and 
isolated 

Two primer strands are used as templates in a PCR: (94°C 5 
minutes) + 99 cycles of (94°C, 30 seconds; 60°C, 0 seconds) +■ 
15 {72°C, 5 minutes) using random primers of various lengths, Taq 
polymerase and standard buffer conditions as described in Exam- 
ple 1. 



The resulting appro 



1500 bo product is cloned ei- 



as TA clonina as described in 



Examole one (for verification 



20 




clone 



or into Bacillus vector, 



utilizinc Sfil ar.c 



res 



triction 



sites . 



The oricinai temolaie can be removed from the PCR at any 



(e.g. after 5 , 10 



cvcies) based on the biotin tag). 
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PATENT CLAIMS 

1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
5 single or double stranded parental DNA templates and primers by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA 
templates, 

c) extending said primers by initiating synthesis by use 
of said polymerase, 

d) cause arrest of the synthesis, and 

e) cenaturing the double strand to separate the extended 
primers from the templates, 

3. a template shift is induced by 

a) isolating the newly synthesized single stranded ex- 
tended orimers frc.~ the templates and repeating steps 
A.b) to A.e) usir. :; said extended primers produced in 
{A} as both primer:: and templates, or 

b ) reoeatmc steps A.b) to A.e), 
C. the above orocess is terminated after an appropriate num- 
ber of cycles of process steps A. and 5. a), A. and 3.b), 
or combinations thereof, and 

coticnallv the produced polynucleotides are amplified in a 
standard ?CR reaction wit:, specific primers to selectively 
amolifv homologous poiynu 1 e r t ides of interest. 

2 The method of claim 1, wnerem said polymerase is a ther- 

mostable DNA polymerase, s^c- as Taq, Amplitaq, Vent, Pwo . 
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3. The method of claim 1 , wherein said polymerase is a DNA 
polymerase, such as T4 polymerase, T7 polymerase , E . coli DNA 
polymerase I or the Klenow fragment of DNA polymerase I, and 
said polymerase is added to the reaction mixture after each cy- 
cle . 

4. The method of claim 1, wherein said polymerase is a RNA 
polymerase . 

5. The method of any of the claims 1 to 4, wherein saic po- 
lymerase is an error-prone polymerase, such as a reverse tran- 
scriptases like HIV' reverse transcriptase. 

6. Tne method of any of the claims 1 to 5, wherein said 
starting single or double stranded parental templates are dif- 
ferent m that they contain different point mutations in the 
same native polynucleotide (gene), or are homologous polynu- 
cleotides (genes) isolatec from nature. 



e method of ciair z, wherein said starting single or 
.tranded parental templates exhibit an identity at the 
C ! i level of e.g. more t h a : 
6 0%, 5 5%, or even more than 



5%, S0%, 55%, 80%, 75%, 70%, 65%, 
;% identity. 



£. Tne me in co c: ^ny c: 
rest of the polymerase rt 
raisinc the temperature. 



s 1 to 7, wnerem tne ar- 
steo A.d) is achieved by 
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10. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step l.d is achieved by add- 
ing specific reagents, such as DMSO. 

11. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of semi-random primers. 

13. The method of any of the . claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

14. The method of any of the claims 1 to 13, wherein said 
onmers are labeled, preferably w;:h biotm or digoxigenm. 

15. The method of any of the claims 11, 12 or 14, wherein 
sa;c rancom cr semi-ranccm primers have a length of from 6 to 
5:: bo oreferablv from 15 to 305 bp, ar.d better from 25 to 150 



16. Tne method of any cf the claims 13 or 14, wherein said 
crimers are 6 to 75 bp lone. 

17, rr.ethod cf any of the claims 1 to 16, wherein said 
5 - p ~ parental DN- te~olates are at least 50$, 60*, 70%, 

80%, 30%, or S5°3 homologous. 

IB. .The method cf claim 1~\ wherein said starting parental 
D!3A -emolates oriainate from wild type organisms of different 
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19. The method of any of the claims 17 or 18, wherein said 
starting parental templates have a length of from about 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 
about 200 bp to 7 kb, especially about 400 bp to 2 kb 

20. The method of any of the claims 1 to 19, wherein said 
starting parental templates are linear DNA fragments generated 
by a PCR reaction. 

21. The method of any of the claims 1 to 20, wherein said 
starting parental templates are . cloned into suitable vectors, 

such as a piasmid. 

1 5 22. The method of any of the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
an enzyme, such as a carbor.yi hydrolase, ca rbohydrase , or es- 
terase, such as a protease, lipase, amylase, cellulase, oxi- 
dase, oxido reductase, etc. 

„^v,^ a--.- cf the claims 1 to 21, wherein said 

startmc parental templates represent a polynucleotide encoding 
a polypeptide having biological activity, such as insulin, 
luca-on, somatostatin, somatotropin, thymosin, parathy- 



25 rcid hommone, pituary 



— c"°s , somatomedin, erythropoietin, 



1 „^; r ,.; n: honr.sr.e, chorionic gonadotropin, hypothalamic re- 

~~™~-o 5 thvrcic stimulating 
leasing ractors, an.id^ure .._...-..-:>, ^ 

- - -shjscoceitin (T?0) and prolac- 



^. ^ r-; <-■- — c - to 21, wherein saic 

24. Tne metnee o^ c .. ■ • ■ - 

fc - _„^ t; ^ c ,- ~ ~ * p colvnucleot ide contam- 

startmd oarenta^ ^.e.^-s^s - 
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ing a biological function such as transcription initiation 
(promoter) or termination, translational initiation, operator 
sites related to expression of genes 

5 25. The method of any of the claims 1 to 24, wherein said 
number of cycles are less than 500, 200, or less than 100 cy- 
cles . 

26. A method of identifying a polypeptide of interest exhib- 
10 iting improved properties in comparison to naturally occurring 

or other known polypeptides of the same activity, whereby a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any of the claims 1 to 25 are cloned into 
an appropriate vector, said vector is transformed into a suit- 
15 able host system, to be expressed into the corresponding 
polypeptides and displayed, and said polypeptide is screened in 
a suitable assay, and positive polypeptides selected. 

27. A method for producing a polypeptide of interest as iden- 
20 tified according to claim 25, whereby a vector comprising a 

polynucleotide encoding said identified polypeptide is trans- 
formed into a suitable host, said host is grown to express said 
polypeptide, and the polypeptide recovered and purified. 
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